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Genomic sequence of the mouse villin gene regulatory sequences 

GATCTGGTGC ACCAAGGACA CTGTGGTCCC AGCACTGGGG AGGTGGAGGG AGGAGGGTCA 60 
GAAGTTTAAG GTCATCCTTG GTTACATAGC AAGGTTTCAG CCAGCTTCAG CTACATGAAA 120 
CCTTTGTTTG TTTGTTTGTT TGTTTTAAAG CATTAATAAA TAATACCATA AGGAGGTTGG l80 
CAGTGGTGGC AGACACCTTT AATTCCAGTA TTCAGGAGGC AGAAGCAGGC AGATCTCTGT 240 
GAGTTCGAAG TCAGCCTAGT CTGCAAAGCT AGTTCCAGGA TGGCAAGGGC TACACAQAGA 300 
AACCTTGTCT CATAAAACCA AAGTAGTAGT AGTAGTAGTA ATGCCATAGA GAAAATTGGA 360 
GTCCATTCAG GATGGACCAT CCTATAAGAT GATTCTCTTG ACCCAGGTAA GCTAATGTCA 420 
TGGGGAAAGG GGATGGGACT GTCCTAGATT AAAAAGTGCT GAGGCGATGC CTATTCTCAA 480 
TTTGATTCCA TATGAAAAGG CTGATAAGGC CCAAGAGAAG TGGAACTGGG ACTCTGGACT 540 
GAAGACGTGA CGGCCTTATA AACACTGGCA CTTATAAACA CTTATAAACA CTGGCACAGG 600 
CGTTCAGGTT TGAAGATCAC TTTCAAACCA CAGAACAGAA AGTGCTCGCT CGTCCTCAGC 660 
GTAGCGAGCA CTGGCTGCAG AAGAGTGATA TTTAGTGAAA GCTACCTTCA CAATATCTTT 720 
GCACTTATCA CATACACGTG TCAAATGTGC TAACTCCCTA GTCCACAGAT GGCTGTTACA 780 
CTCGTTTCTG CTTTCCCATC TGGTTGACAT TTGTCAQAAC CAGAAATTAG AAATGTGGGT 840 
ATTTATTTGT GTGCTGAGGA CACCATCCAG GGCTTTTCAC ATTTCAGGCA CATGGTTTAC 900 
TAACTGGGCT ACTTCTCCAA CGGTTTGAAA CCATTTGTTT TATATTTACT TATTTTGTGT 960 
GCATGAGGTA GGCATGTATA CGTATGTATA GGAGTCATGC ATGTGGCTGC TACCCTCAAA 102 0 
ATCATTGCAG ATCCCCAGCA AGTGAAGTCA CCGAGCGTTG TAAGTTGTTA TGTGGGACTG 1080 
GGAGCCAAGG CTGGGTTCTC TGCAAGAGCA GCCAGTGGCC TTAACCATGG GACCAGCTCT 1140 
CTAGGCCTAA GGTAATCTTT AGTTTTTTAA AAATATATAT TCTCAGCCGG GTGTGGTGGC 1200 
ACACGCCTTT AATCCCAGCA CTTGAQAGGC TGAGGTGTAG GAATTATACA CACAGGCCAG 1260 
CTGGGGTGCA GAGCTTGGCC CTGTTTTTTT TGTTTTTTCT TTATGTGCAC TGGTGTCTTA 1320 
CCTGCGTGTA TGTCCGTGCA AGGGTGTCAG ATCCCTTGGA GCTGGAGTTA AAGACAGTTG 1380 
TGATCACGCT GCCGTTACAG ATGCTGGAAA TTGAACCCAG GTGTCCCTAG AGAAGCAGGC 1440 
AGTGCTCTTA ACTTCTGAGC CACCCCTCCA ACCCTGCTTT TAGAGACTCT TAACCTTTTG 1500 
TGTAATGTGG GAACTGAGTG GATCTTGCAC TTACCAAGTG TGTGCTGCGC TGTAGCATCA 1560 
CTGAGCCCGT ACCCACACGA CTAGTGGATA CAGTTTAAGG GCAAACACTT AACAATGACA 1620 
ATAGTTGGAT AGAGTTTGAA TATAGTCCTG AGCTATTGGT TAGCGTGACC TTTGCTGTCC 1680 
TTAGCATGTG CTGTGAGAAG ATAGAAAAAT GAAGACTTGA GTCTAGTCCT GGAACCCACA 1740 
GAGGCAGGCG AGAACCCACT CCTGAAAGTT GTTCTCTGAG CTTCACATAC AACTTCACAT 1800 



FIGURE 6A 
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AATAGTTACA ATGATAATAA TAATTAGTAA ATTCTTTTAA AAGGTATATG TTGGGAGGGA 1860 
GAGATGGCTC AGCTTCCAGG AGCACTTGCT GCTCTTGCAG AGGACCTAQA TTCAGTTCCC 1920 
AGGACTCATA TGGTGGCTCA CAGCCATCTG TAAATCCAGT TCCAGAGGGT TCCACACCCT 1980 
CTTCTGGCCT CCACAGGCAC CACATACATA GTACACAGAC ATACATGCAG GCAAAACACC 2040 
CATACACACA TAAATAAATA AGGAAACTTA AAAGGTGCAT GTGTTGGTAA ACATTGTGCT 2100 
TACACATGCT GATTGAAGAC ATGTACAACG CACACACTGA AGAGGGATCT GGGGCTGGAG 2160 
AGATGGCTCA GCGGTTAAGA GCACTGACTG CTCTTCCGAA GGAAGGTCCT GAGTTCAAAT 2220 
CCTAGCAACC ACATGGTGGC TCACAACCAT CCATAATGAG ATCTGACACC CTCTTCTGGT 2280 
GCATCTGAAG ACAGCTGCAG AGCTACAGTG TACTTAGATA TACTAATAAA TAAATCTTTT 2340 
TTTAAAAAAA TGAAGAGGGA TCTGAGACAC CTCAAAAGAG ATTATGAGCA GTQACTCACG 2400 
GGTGATTATC TATCCTGGAG TTTTTCCTTT CCGCTTGGCT TGCAACTGGG TGGACAGACG 2460 
CCCCTTTTCA TTCACAAGAA CGGGTGCTAC ATTATTTCTG AACAAAACAG CACCTGCAGT 2520 
ATGTTTACTG TCCTTGCTGA CTATGAGCAC GCGCACGCGC GCGCGCACAC ACACACACAC 2580 
ACACACACAC ACACACACAC ACACACACAC ATTCAGTCTC CAGAGCTCTT GGGAAGGTCA 2640 
AGAAGAGGCT GCCCTCAAAC ACGATCTTCA TCTTTCCCTC CTAAAGGAGA CCACGATTCC 2700 
AAGGTGGCAG AAGATCTACA GGGGGCAGAG GCAGGGAGGG GGAAGCAGGC CATGGTTTCC 2760 
AGAGACCTAC AGCAGAGGGC AGCAAGGCAG ATCCCCAGGT CCAGGGCAGG GAGGTGGAGG 2820 
CCCTTGTTCC GAGGAGAAGG CAGGCGGCAG AACAGGGTTC AAAGGCACAG GTTTATGGCA 2 880 
GCTCATAAAA GTGGAGGTCG TGGCTCACTC AGAAAGGAGG AAGAAGGGAA AGGCCCTTGT 2 940 
GCCCACTGAG CGAGGGTCAT GCTGAGTAGG AGAGATCTGC AGGGGTGCCA GGAGCCCCAC 3 000 
CTGTCTGTCC CAAGGQAACC CCAAGTGTGA ACTCTGGCCT TGGGTGCTGA GTTCCAGCTA 3 060 
CAAGACCCCA GGAGTCCTAC TCCATCCCCA TCCAGTGCCC CCTCGCCCCG CCACACCCCA 312 0 
CCCCCGACTC CCGTGCCACT TCTCTAGGGC TGGAGGGTGG CCAGCCCTGG TGGGGGTTGC 3180 
CTACCTGCAG GTAGAGCCCA GGTCCTAGCC GQAAGTGCAC CCCATCCCTG AAGCTGCAGA 3240 
GCCAAGGGCG GGGCACACGG CAGCTCAGGC TGTCAGGCTG TTGCTGGGCT CTAGGTTCCC 3 3 00 
AGGGACCTGG GCACCTACTT CCCCACCCCC CCATCCATTC TCTCTGGGGC CCTATCTTCC 3 3 60 
CTTATATGGT GAAGGAAGTT CCTGGGGGGG GGGGGTGGTG GTGAGGACAA AGGTCGTTCG 3420 
GTCTCCTGCA GCCAGCTTGC C ACAACTTCC TAAGATrrc c CAnoTcyiTry; nnrrrcrrr 3480 
+ 1 exon 1 

(transcription start site) 

CAGACAGG TA AGGCAATTGG GTGGGGACAC ATGGTGACCA CAGGTGGTTG GAGGGGACAG 3 540 
GGTCCTTGCT TCTCTCTGGC AGCCTGTGCT TTCTGTAGCA CCTTGGTATA AGTTTGGGGG 3 600 

FIGURE 6B 
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TGAGGTAAGG TGCTCTGAAA CTCTGAAAGA AGCAAGAAGC CAGCAGGCTG TCTTGGGCCT 3660 
TCAATGAAGG AAGTTCACAG ACCCCCTTTC CTGTAAGTCA CCTTCGCTTC ATCTGTGTAG 3 720 
ATTCCCTGGG ACCAAGGTGG CTCCTGGGAC TCAGATTTCT ACAATTAAAA TCAGGACAGT 3780 
CCTGAGACTT GGACTCCGTG CCTGTATTTA CTACTTCTCT CTGGCTGCTC ATTTCTGTGT 3 84 0 
TCATGTCTTA CACATCTGAA ATGGTTTCTT TGTGTCACCA TTCCCCTGAC ACTCCTGGGA 3 900 
GGTCGTATCC TTGGCACATG TATCCTGGGA TGTAAGCTGC AGCCACCAGG AGAGAGGGGG 3960 
AGAGTCAGGA GCTGTGTCCT AGGCCCTATT AGGCCTGGAC ATCACCCCTT TCCTAGAAAT 4020 
GGCCCCTCCA TTTTTCGGTT ACCATGATCT ATTTTATATC AGAGTGGGCA GTGAAAGCCA 408 0 
AACCTGCCCA GAAGTTTGGG ACTCACTCAG ACCAAGGTTA TCTGCTCAGA AATCCCCCTG 414 0 
TCACTTGAGG TTGGGAGAAT CTGCCTCTGG GGGCTTCCAG GTCTTGGTTA GCAGGAGGGT 4200 
ATCCTTTGTA TAGGGCATGA CCTAGTCTAT GGTGTTACTA CATTCCTGTC CAGTTAAAAG 4260 
CTGGAACTAA AACCCACGGC AGCGCCCAGG ATTCTCTACA GTTGTACCCC AAGAACAACA 4320 
AGACAGTAGA TATGCAAGGA TAGGTAGCTG GGGAGAAGAA GAACTTAAAC CCCCCCAAAG 4380 
GCCCACAGGT TCCGTTCCCT AGTTCACAAT GCCAGTATGA GTGCTAGCTA CTATGGGCTG 4440 
TGAGTTGGTA GCTACAAGCA TGAGTGATGT TCATGTGTGT AGTGTGTATA ATCTGAGCAC 4500 
TTGGQAGGCT GAAGCAGGAG GATTGCTATA TGTTTGAGGC CAGCCTGAGC TATAGAGCX3A 4 560 
GACTTTGTCT TTAAGAAAAA AATGAAAGCC CAGCAGTGGT GGCACACX3CC TTTAATCCCA 4620 
GCACTTGGGA GGCAGAAGCA GGCAGATTTC TGAGTTCAAG GCCAGCCTGG TCTATAGAGT 468 0 
GAGTTCCAGG ACAGCCAGGG CTACACAGAG AAACCCTGTT TTGAAAAACC AGAAAAACAA 4740 
AACAAAACAA AACAAAACAA AACCCAAACC CAAACCCAAA CCTCTCATCT CTCATCTCTC 4800 
TAGGCTGTGT CTGTCTAGGT GGTAGAGTTT GGGGACTTCA GACTTATATA TAAATAGGCC 4860 
TTTTTATCAC TGGTCAGAGA CGAGAAAGGT TTCAGTCTGG GACACAGTGG GACCCTGAGA 4920 
AAGTACTCCT TGCCAGCCCA AAAATTCTGG GAAGGCTTCC TGGAGGAAGT GTGTCCCGAT 498 0 
CAGACTACTG TTCTAGAAGG CAGAAGAGAG GGTTGGAAGA ATGTTGGTGG ACAGACAGTT 504 0 
GGAACAGAAG GACAGGAGGG GGAGGCATCC AAGATTCTGA ACATGTAGCT GACTTTTGGT 510 0 
TCTCTGGGTG ACAAGTGTCC CCCAGGGATA GGGCTGTAGA AAGGGGACCA GGGGTGAGCC 516 0 
AATGAGTTCA AGTTGAGGGA CACATCCAGC CCAGGGTCCT TGCTGGCAAG CTAAAGAATG 522 0 
AGAGCCCTCT AACCCTCCCT GAAGTTTAGG GGAGACAGGA GAGCTGAGGA GATCCTTCTA 528 0 
GGGTGAAGGA GAGGTATCTG CTCTGACCAA CATGGCTAGG AGCAGAAGCA GTTGGACCAG 534 0 
TTACCCCTCA GAACCAGCCA TCCCCTCTTG GCTCTAAGGA GGCTGGGCCC CTTTCTGTTT 54 0 0 
AAGAATCTTA CTTTTCTTCA GAGAGAGGCA GCAAGCCTTT GTCCCCTCCC TGTTGGTCAA 5460 
TAAACACCCC TGTGTGTAAC ATTAGTTTAT TTTACTGTCA GTTTGCTCCA GGACAGTCCA 552 0 
FIGURE 6C 
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TCTGGTAGAC CTCTGCTCCT AACTCACCAA GGTATGGCCC ACATTCCTCA CCCAGAAGAG 5580 
TGCAGAAGAG AGCCTTAGAG AAAGGGTAAC AGTAACAAAG ATGGCCAGAA TAAAACAAAA 564 0 
ACTACTATCC TTTGTACCCA AATTGGTTTT GCTGAACCAG GAGGGGGTGT GTGAGTGTAT 57 00 
GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 5760 
CTTGGGGGAC TTTTCATGCT AAAGAATATC TGATATTGGC GCCCATGCCA ACAGGGGTAT 582 0 
TGGGGAGAGT CAGGCTTCTG CAAACACAGT AAGCTGCCCA AGATGGATTG GTGGCCTGAA 58 8 0 
TCACCAAGGG GCAGGCTGAT CAGAGTGGAC AGAACATCAC AAGATAAGCC ACCCTGTGGG 5 94 0 
GCTCAGAAQA GGGAGTTTAC AAGAGGTAAA GGCCAAGCCA TTTATTATCC AAGACATGAC 6000 
TCAAAATCAA AGTGCAAGGA GAGATTAGCT GGAGAGATGG GGCTGTCAGT GTGGGACACC 6060 
TGACCTTGCA CTTATTAGTC ACTAGGCCAA GGAGCAGTCA CAGAGGGTGA CTGGGTCCTA 6120 
CTCAGCTTGG AGCAGGCACG TGGAGAATGG GTGACCTCCA TCCTGATGGA GAGGGCTGAG 6180 
CACCACCAGG TACAAGTGTT CCCTGTGTCT CATGCCAGGA TTCCTGGCCA GTTTTCAAAG ' -624 0 
GACTAAGGAC TCATCTCTGG TGGAAACAAA GTATCCAAGC CCTAAGCCCC ATTTTGGTCT 63 0 0 
AATTAAATCA GAACCCCTGG GGATGCAGGC TCTGAGCAGC AGGAGCTTTT TAAAAAGCTC 6360 
CCAGGTGATT CTGATCAGCA GCTGGAACAA ACACAGCTAC AGGTTCAAAC AGAAAGAGGC 642 0 
AAAGCTAGGG AAAGCTTGGG ATGGGGAGCC TTCTTCCAGG CCAGTAGATG GAGGCTGGTT 648 0 
AGCAGTGGTG GCAGCTTCTC TCTGCCTGTC ATATAGCTAT CCATCCACTC ATCCATCCAT 6540 
ACACCCACCC ATCCATTTAT GCACCCATCC TTCCATCCAT CCATCTATCC AGCTACCCAC 6600 
CCACGCATCC ATCCAAACCT TCCTTTTCTC CTTCTTTCTT TCTTTTTTCC TTCACTCATT 6660 
CATTTATCCA ACAGAGAACT GGTATTGTAC TAAATGTGGG AGATTTAATT AATTTTTAGA 6720 
AGCTCTGTTG ATTGACTGAT TGTGCATGTA TGTGGACAGG TACATACCAC AGCACACGTG 6780 
TGGCAATCGG AGAAAGGTTT TGGGTGTTGT TTTCTCTTCC CACCGTGTGG GTTCTGGGGA 6840 
TTGAACTCAA ATTATCGGGC TGGTGGCAAG TGTCTTTACC ACCGAGCCAT TTTGCTGACA 6900 
CATCATTATT ATTAGAAAGC ATCTTATGTA GTCCAGGCTG GCCTCAAGCT TGCTATGTCG 6960 
CCACGGATGA CCTTTAACTC CTGCTCTTCC AGCCTCCACC CGAGTGCTAG GTTTACAGGT 7020 
GTTCAACTGG TGAATGCCTT TAATCCCAGC ACTCTGTGGG GGGGGGGGGG GAGGCGGATC 7080 
CCTGAGTTGG AGGCCAGTTT GGTCTACAGA GTTTCAGGAT ACCTGGGGCT ATACAGGGAA 7140. 
ACCCTATCCC AAACAAACAA ACAAACAAAC AAAAAATATT CTGTGCAATA ATCACAGAGA 7200 
TTAGAGGATA TTAGTAGGGT AGTAGGGCTG GTGAGGGAGA GTCATGCTTT CTTTTGTATT 7260 
ATAATAGTAA AGTACTCACA AGATGCATTA TCTATCTATC TATCTATCTA TCTATCTATC 7320 
TATCTATCTA TCTACCTACC TACCTACCTA TCCATCCATC CATCTATCGT ATAGCCCAGG 73 8 0 
CTGCTTTGAC TCTGAATGCT CCTATTTCTC GGTCAACTCT TCACCCCTAG TGTTGGGTTT 744 0 
FIGURE 6D 
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ACCAACACCC AGACATTTAT TTTATTTTGT TTTATTTTAT TAATCTAGGA GCTCAGGGTG 7500 
GGACTCAGGG TCTTGTGCAT GCTAAGCAAG CTCTCTGCCA CAGAGCTGCA GCTCCAGTCC 7560 
CCATTTTGTT CAGGTGACTC TGTGACAGTT GTCATATTCG CAGCGCTATG TAGCTCTCTC 7620 
CACCTCCCAG TTCCAGCACT TTCTGGTCAT CCCAGTGGGC GGGCAACTCT GTGCTCACCA 768 0 
GTGCCCTGTT CCCTGTCTTC AGACCTACAT ATTTGCCTGT CTGAACAGTT CATGTAAATG 7740 
GGATGCGTTC CTGTGTATTC TTTTATGGCT GGCCCCTTTA TCTTAGCACA GTTTGTGTTG 7800 
GGCCATGTGT CACTGCTATA CTCTATCTTA TCATCATCTT ATGGCTTAAT AGTGTTCCTT 7860 
TGTGTGGATA AACCACTTTC TGTTTCATTT ACTGATGGAA ATTTGTGGCC CCACCCCCAC 7920 
CCTTTTTTTT TTTATTTGAG ACAAGGTCTT TCTGTGTAAT CTTGCAATCT TGGCTGTCCT 7980 
GAGCTCACTC TGTAGACCAG GCTGTGAGGC TGTCCTTCCA CTTTTQACAC TCCTGTGAAC 8040 
AGAGTAGCCA TGAACTTCAA AGACAATTTT CTGTTTTGGT TTGTTTTTTA CATTTGTGTG- 810 0 
TGTATGCGTG TATATGTGCA TGTTTGTGTC TTCAGGTGCT CACATGTGTG TACCTGTGTG' 8160 
TGGGACAGAG AACRAACCGA TGTGCCATTC CTCAGATACT ACGCATCTTG TTAATATGTA 8220 
TGTATTATGT ATGTTTATTT AGTGTGCCCA AGTATGCAGG TATTTTGTTG GAGTTTTCAC 8280 
CTTCCCTTGT GGGCTCTCCG CATTAAACTC AGCTCCTCGG GCTAGTGAGC AATGCCTTCA S340 
CTCGATGAGC CATCTCGCTG CCCCTGCTGC CACCTCCTCC TTATTTCCCA GATGGGACTA 8400 
CGCACTGCAC TGGCCTAAAG CTCACCAAGT CATCCAGAGT GGCTAGCCAG GGAGACTCAG 8460 
GGATATGCTG GCCTCTGCCT CCACAGTGCT AGAATTACAG GCATACATCA CTGCTGGAAG 852 0 
ATTTTTAACC TGAATCCTGA GGATAGAGCA GGCACTCTAC CAATGGAGGG TTCTTTTTGT 8580 
GTTTGGTTTG GTTTCCTCTG CATAAGATCA GGCAGTCTGA AATAGTGTAG CCTGGGCTAC 8640 
ATAACATCTT GTCTCAAAAA GCCTATAGAG GTAGGGAGGT CGAGGCTAAA GAAGAGCCTT 8700 
AAGCCGGCTG TGATAGCACA CAGGATAGCC TGCACTATAT AGCAAGACCT TGTTTCAAAA 8760 
ACATGGAGGG AGGGGTATGT TTTAAGTGCT GGGCTGTGTA ACAGGCACTA AGGGAGCCAA 8 820 
TGTAGACATT TGACTAAGAA AGGATCATCA TCAAAGCCGG GTGGGCAGGG TAGAGGTTGG 8 880 
ACTACAGTGG TCAAGACCCC CATAGGAAGC CAGTTTCCCT TCTTCCTCTG GGCCTCAAGC 8940 
CTGGCTCGAC GGCCACTGCT CTCACATGCC TTCTCCTCTA G GCTCGTCCA rnA TQ 8 995 

exon 2 



FIGURE 6E 
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FIGURE 8A 
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FIGURE 8B 
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FIGURE 8C 
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